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Abstract 

The classical approach in finance attempts to model the term structure of 
interest rates using specified stochastic processes and the no arbitrage ar- 
gument. Up to now, no universally accepted theory has been obtained for 
the description of experimental data. We have chosen a more phenomeno- 
logical approach. It is based on results obtained some twenty years ago by 
physicists, results which show that Pade Approximants are very suitable 
for approximating large classes of functions in a very precise and coher- 
ent way. In this paper, we have chosen to compare Pade Approximants 
with very low indices with the experimental densities of interest rates vari- 
ations. We have shown that the data published by the Federal Reserve 
System in the United States are very well reproduced with two parameters 
only. These parameters are rather simple functions of the lag and of the 
maturity and are directly related to the moments of the distributions. 
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1 Introduction 



The classical approach in finance attemps to model the term structure of interest 
rates using specified stochastic processes and the no arbitrage argument. Numer- 
ous theoretical models (see for example [|31 



ij) have been proposed. 
Although they provide analytical formulas for the pricing of interest rate deriva- 
tives, the implied deformations of the term structure have a Brownian motion 
component and are often rejected by empirical data p. 

The inadequacies of the Gaussian model for the description of financial time 
series has been reported for a long time [ T9j , [20] but the availability of enormous 
sets of financial data, with a small time scale, has renewed the interest in the 
subject (see [p7| , 



0- 



^5]). In particular, the fat-tail property of the empir- 
ical distributions of price changes has been widely documented and is a crucial 
feature for monitoring the extreme risks [17]. Most of the recent studies 
concern stock indexes or exchange rates [25], [[H|, [p7]l , with high frequency data. 



Studies on interest rates are rather rare and often limited to a few maturities 
(three and six months cash rates in [28], Bund futures in ||). The paper || is 



an exception as the US forward rate curve with maturities up to four years is 
modeled. 

There is a long history of attemps to model price fluctuations with Levy 
distributions |19| . The family of Levy laws exhibits qualitatively heavy tails and 
enjoys the suitable properties of stable laws M, IJT2J. However, they are defined 
in an unduly complicated manner (using the characteristic function) and do not 
even have a finite moment of order two (variance). Truncated Levy distributions 
have been proposed to circumvent this drawback [2^], [^1|]. Other alternative 
distributions have also been proposed (see (T^j, |§, f26fl ). 

Our aim, in this paper, is to propose an alternative description of the empirical 
distributions of the variations of interest rates in a simple form, with a small 
number of parameters but enjoying the property of providing a remarkable fit of 
the full distribution and hence, at the same time, of both the central part and 
the tails. 

Twenty five years ago, physicists have started studying sets of reduced rational 
forms which are particularly well suited to approximate, in a very uniform way, 
functions which belong to rather large classes. These forms are called Pade Ap- 
proximants. Many major progresses in the understanding of these approximants 
have been made at that time. Since there is a very extensive literature concerning 
the validity and the usefulness of these approximants, which are intrinsically re- 
lated to the theory of rational polynomials, to the theory of continuous fractions 
and to the theory of moments, we refer the reader directly to some relevant books 
II] , |3J] where more references can be found. We restrict ourselves, here, to 
a short presentation of the ideas leaving out the fine points. The basic idea of 
the Pade Approximants is to approximate a large class of functions of, say, the 



variable v by the ratio of two polynomials T (y) and B (v ) of respective degrees 
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M and N. 

In this paper, we propose to apply the Pade Approximants to the study of 
daily interest rate changes and show their relevance on data obtained from the 



American bond market from 1977 to 1997 [11 



In Section 2, we discuss the discreteness of the data and their approximation 
by continuous distributions. In Section 3, some general properties of the Pade 
Approximants are outlined. Section 4 contains a justification of the criteria chosen 
in order to assess the quality of the fit. The data are analyzed and our results 
are presented in Section 5. Section 6 contains the conclusions and some final 
remarks. 



2 Interest Rates 

Consider the term structures of interest rates given by samples of N tot daily 
interest rates for constant maturities [m]. Let us call 

/H(t), t = l,...,N tot , (1) 

these spot interest rates where the upper index [m] (in unit year) specifies the 
maturity and t indexes, chronologically, the opening days. 

A short digression on our notation for the "units" of interest rates is useful 
at this point. Interest rates are canonically, like in the data base we used, given 
in "percent /year" (which we will henceforth denote [%/year] and which is usu- 
ally and abusively simply called "percent") with two meaningful digits after the 
decimal point. Hence, the natural unit we will use for interest rate variations 
is the basis point 10 _4 /year which we denote [b. p. /year]. The inverse unit is 
[year/b.p.] will also be used. The daily changes in interest rates are obviously 
also recorded as integers in units [b. p. /year] and vary from roughly —100 to +100. 
for a maturity of one year (this interval may be larger for higher lags). The unit 
[b. p. /year] is the natural unit to perform discrete summations and to expose the 
problem most simply. However, at the end, the parameters entering the problem 
and determined from the data are recorded in the [%/year] (and its inverse unit) 
better adapted to their actual values. 

See Appendix A for a detailed discussion of the dimensions of the variables 
and of the parameters we will use and hence of their behaviour when units are 
changed. 



2.1 Variations of the Interest Rates 

From the interest rates I^ m \t) (integers in units [b. p. /year]), variations of the 
interest rates 51™ (t) at a lag of L days can be defined for each maturity [m] as 

5lt\t) = I [ r\t + L)- I [ r ] (t), t = l,..., (N tot - L) . (2) 
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These (overlapping) variations of interest rates are expressed again as integers in 
units [b. p. /year]. Our main goal is to study and to compare the distributions of 
51 l if) f° r a U available maturities and for all time lags which we have chosen in 
the range from L = 1 to L = 30 days. 

2.2 Distribution of the Variations of the Interest Rates 

For every variation equal to a given value v in units [b. p. /year] (where v is an inte- 
ger), we can count the number of times this variation occurs in the experimental 
sample. We denote these distributions of variations by N^iv) 

N^iv) = number of times 51™ (t) = v 

= card {t, 5I l r ] (t) =v} . (3) 

By construction, we have iV^ (ti) equal to zero outside the range [v m i n} v max ] 

v min = min{5/[ m] (t)} 

v max = maxjwj™ 1 ^)} (4) 

and 

£ Nt ] (v) = N tot -L. (5) 

Obviously, v m i n and v max depend on the maturity [m] and on the lag L but, to 
simplify the formulas, we have suppressed these extra labels. 
At this point, it is worth making two comments. 

1. The opening days are not always consecutive, due to the presence of Sun- 
days and other non working days. We have considered that the definition 
of the lag ignores these gaps. We work with a business time scale rather 
than a physical one. A more careful study could take into account this fact 
and limit itself to consider as belonging to lag one only those days which 
are non separated by one or more missing days. In order to keep our useful 
data as large as possible we have chosen to ignore this difficulty A more 
refined discussion could obviously take this into account at the price of a 
lowering in statistics. With high frequency data, the definition of a time 
scale that models the market activity is even more crucial 

2. For lags greater than one, the distributions we have defined take into ac- 
count overlapping periods. For example for lag two, one could have defined 
two distributions by using non overlapping two days periods: the distribu- 
tion of the even days and the distribution of the odd days. The number 
of points in these two distributions is divided by two (for lag L, it would 
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be divided by L) decreasing the statistics (in a too drastic way when L is 
larger than four or five days for our limited sample). This would however 
enable one to study autocorrelations and volatility propagation. In [27], 
the authors made a precise study of these overlapping effects, varying the 
overlapping intervals. Their results were never systematically affected by 
these variations. With overlapping periods, the increments are clearly not 
independent. However, we will assume that they are identically distributed 
at a given time scale L. This unconditional approach is motivated by the 
fact that extreme events occur infrequently and do not seem to exhibit time 
dependence [|J. 

The empirical discretized density function fi (v) of the variations of the 
interest rates v for every maturity [m] and every lag L is defined by 

;h(*h^). ( 6) 



Again, /| mJ (v) is defined for integer values of v, zero outside the relevant [v min , v mo r 
range and normalized 



V 



E fr\v) = i. (?) 

The analogous continuous densities f(v) are continuous functions of the con- 
tinuous variations v. We chose [b. p. /year] again as the unit of v. The correspond- 
ing unit for / is the inverse [year/b.p.]. They are normalized in such a way that 
their integral be equal to one. For the experimental discretized distributions @, 
the normalizations are equivalent to stepwise integrals 

l-Vmax , , '"max 

Normalization data = / ft\v)dv = £ ^\v)) = l. (8) 

u — u min 

Obviously, for continuous variations v, the density /| m '(f) takes the constant 
value fi^(v) in the interval of length 1 centered around the discretized variation 
v. Precisely 

ft ] (v) = fV(v) (9) 

for 

v - -) < v < (v + -) . (10) 



2/ _ V 2, 

Outside the [v m i n , v max ] range is / taken to be zero so that the limit in the integral 
can be taken to be — oo and +oo. 

The behaviour of these quantities under changes in the units is outlined in 
Appendix A. 
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The moments of order two of the distributions are again defined by a step- 
wise integration. To a very good approximation, since the mean values of the 
distributions are essentially zero, these second moments can be identified with 
the variance 

Variance data ee H v 2 f [ [ n \v)dv = £ (v 2 ft ] (v)) . (11) 

J CO r _ V 7 

These variances (obtained in units [b. p. /year] 2 ) are obviously finite by construc- 
tion. We however hope that the true variance, which is in principle less securely 
computed from the data than the normalization (R), is correctly estimated by 



the discretized computation ([11]) . In other words, we conjecture that the true 



distribution is of finite variance contrary to some theoretical distributions like 



the Levy distributions which have been proposed in the literature (see [22|, |p4 |, 
0) and can lead to an infinite variance. 

Later in the paper, we will discuss the variance and will also comment on the 
higher moments (skewness, kurtosis) of the distributions. 



2.3 General Comments 

We expect that the distributions of the variation of interest rates in the contin- 
uous variable are smooth functions of v and hence should be approximated by 
(normalized) continuous functions. These distributions are essentially maximum 
for a variation v = and decrease on both sides i.e. in the directions v — > +oo 
and v — > — oo. We also expect that the distributions are, in first approximation, 
symmetrical around v = and hence even functions. These conjectures turn out 
to be very well substantiated by the experimental facts. As we will see however, 
we have decided to allow for a very slight shift s in the axis of symmetry. The 
distribution will then enjoy the symmetry (v — s) <->• —(v — s). The extra param- 
eter s will turn out to be practically zero (less than a few [b. p. /year] and rarely 
different from zero at conventional confidence level, see Table 2 and Table 3). In 
the next section we will give, in a more precise way, our notation and our basic 
hypothesis. 



3 Pade Approximants 

In this section, we fix the notation which will be used for the Pade parameters. 
We also discuss, for the approximants we are going to use, their normalization, 
their variance and their positivity. 
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3.1 Presentation of the Pade Approximants 

Pade Approximants p\- M ' N ^{v) are rational functions of a continuous variable v of 
the form 

p[M,N] / \ = T 0) (12) 

and are indexed by the degrees M and iV of the polynomials T M and B N ap- 
pearing in the numerator (Top) and the denominator (Bottom) of the rational 
fraction P. 

These rational functions are used to obtain good approximations to the con- 
tinuous distributions ([§]) defined in the introduction. Since the distributions are 
real, we choose the polynomials T and B to depend on real parameters only 

As a general requirement, we have to limit ourselves, at the end, to values 
of the parameters of the Pade Approximants such that the densities are purely 
positive. Indeed, even extremely small negative probabilities in the tails would 
not make much sense and should be excluded. 

As we have insisted on in the introduction, we expect that the Pade Ap- 
proximants (and hence the two polynomials T and B) are symmetrical under a 
left-right symmetry around an axis shifted at most slightly (by s) from the v = 
axis. More precisely, we will postulate that the two polynomials are even func- 
tions of the variable (v — s) i.e. functions of (v — s) 2 and hence that M and N 
are even numbers. 

We have also commented about the fact that the Pade Approximants should 
be normalized 

/+oo 
P [M ' N] (v)dv = 1 (13) 
-oo 

and have a finite variance 

/+oo 
pW> N \v)(v - s) 2 dv < oo. (14) 
-oo 

The expectation (mean value) of the Pade distribution is simply 

/+oo 
P [M ' N] (v)vdv = s. (15) 
-oo 

This mean value will turn out to be extremely small in absolute value for our 
experimental samples. 

The two general requirements (jl3|) and ([14]) imply that the even degrees M 
and iV in ([12]) cannot be chosen arbitrarily. We need 

M + A<N. (16) 

On the other hand, though our experimental sample is reasonably large we 
do not wish to introduce too many parameters in the fit. Indeed 
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• It is known that for many classes of functions even low values of M and TV 
will give very good fits 



• We also expect that the resulting distribution should be rather smooth. 
Numerical minimizations with too many parameters will push the routines 
to mimic fake oscillations which are the result of the fact that the sample is 
of finite size. These oscillations do not correspond to any intrinsic structure. 

Hence we have chosen to limit ourselves to iV = 8 and hence from ([16|) to M = 
4. The relevant Pade Appriximants belong to the categories [0, 4], [0, 6], [0, 8], 
[2,6], [2,8], [4,8]. They can all be written at once in the form [4,8]. Restricting 
oneself to some parameters equated to zero will produce the other forms. If we 
allow for infinite variance, the Pade [0, 2], [2, 4] and [4, 6] could also be considered. 

We will now write the general case in a completely explicit form. This allows 
us not only to fix the notation but also to profit from well-known integration 



formulas for ratios of polynomials [30 



To parametrize the numerator of the Pade function, it is convenient to intro- 
duce a complex polynomial U 2 (v) of second degree 

U 2 (v) = u (l + iut(v -s) + u 2 (v - sf) (17) 

and write the numerator of P^ 4 ' 8 ! as 

T\v) = (U 2 (v)YU 2 (v) 

= u 2 (l + (u 2 1 + 2u 2 )(v-sf + u 2 2 (v-sf) 

= n + n 2 (v - s) 2 + n 4 (t> - sf (18) 

which depend on three real parameters Ui, i = 0,1, 2. When M is decreased to 2, 
u 2 = and when M is decreased to 0, both u\ = u 2 = 0. 

For reasons which will be explained shortly, it is even more convenient to split 
the denominator B 8 (v) of the general Pade pl 4 > 8 l ( |T2"D in the form of the modulus 
square of the complex polynomials Q A {v) of degree 4. 

Q\ v ) = 1 + i qi (v - s ) + q 2 ( v - s ) 2 + i q3 ( v - s f + q 4 ( v - s ) 4 (19) 

where the four parameters qi,i = 1, ... ,4 are real. We then obtain the purely 
real eighth's degree denominator 

B\v) = {Q\v)YQ\v) 

= 1 + {q\ + 2q 2 ){v - s) 2 + {q 2 2 + 2q 4 - 2 qi q 3 ){v - s) 4 

+ {(ll + ^2qA){v-sf + ql{v~sf 
= 1 + d 2 {v - s) 2 + d 4 (v - sf + d 6 (v - sf + d 8 (v - sf . (20) 

It is clear that the term of zero degree in this denominator has been chosen to 
be equal to 1 without any loss of generality. When N is decreased to 6, q 4 = 
and when N is decreased to 4, q± = q 3 = 0. 
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Since the form of the Pade we have chosen, which is an even real function of 
(v — s) only, is the modulus square of a complex quantity, the positivity of the 
distribution is guaranteed whatever be the parameters chosen in U 2 and Q 4 . 



3.2 Normalization, Variance and Positivity 

The form (BO) may be thought, at first sight, to be unduly complicated. In fact, 
it has a very important advantage. Indeed, the normalization and the variance 
can be computed analytically ||30|| . First, the normalization ([13|) is 

it ,. , • (Q2Q3 - Qiq^riQ - q 3 n 2 + qin A 
Normalization p a dc[4 81 = 71 5 5 . (21) 

gig2?3 - qtqi - qi 

Imposing that this normalization be 1 decreases the number of arbitrary param- 
eters by one. 

From the expression (|2~lj ) valid for the P^' 8 \ the normalization of the Pade 
with lower M and N can easily be deduced by suitable limiting procedures. Since 
we will mainly use pi 0,4 ! and p[°> 6 \ let us quote explicitly these particular cases 

Normalization p a de[0 6l = — 71 — — — — • (22) 

and 

Normalization p a de[0 4l — 71 — (23) 

q 1 



which for a normalized p[° ,4 l determines 



n as 



^0 = - • (24) 

7T 



The variance ([14]) is 



x j . -9394^0 + q\q^n 2 + (q 3 - gig 2 )^4 , OKA 

Variance Pa de[4,8] = tt 5 T . (25) 

q^iq^qs - q{q^ - qt) 

For pi 0,6 ! this reduces to 

n 



Variance Pad e[o,6] = ?r- 

£3 - qm 



and for ptM to 

n 



(26) 



Variance Pad e[o,4] = -7T 



9l92 

1 



q-2 



(27) 



s 



where the last line in the equation refers to a normalized Pade [0,4] 

To be complete, let us also give the expression for the kurtosis. Obviously, in 
order to have a finite kurtosis, the Pade have to be restricted to p[°< 6 \ p[°> 8 l and 

p[2,8]_ 

v , ■ qiqAn + (q 3 - qiq 2 )n 2 

KurtOSIS Pade[2,8] = = ^ -— 5 ■ ( 28 ) 

qM\q*lz - <?i<?4 - q£) x Variance 

The excess kurtosis is given by subtracting 3 to the kurtosis. The kurtosis for 
p[0,6] rec l uces to 



, . i gi(g3 - ql) 

KurtOSIS p a de[0,6] = 

7T q 3 n 

qxqi 
qs 



(29) 



The last line is for a normalized pi ' 6 !. 

Let us end this section by some technical points 

• First, let us state again that the positivity of the distribution follows triv- 
ially from our writing of the Pade function in terms of the modulus square 
of the ratio U 2 /Q A . 

• Let us note that, in order to obtain the values of the normalization, the 
variance and the kurtosis starting from the formula given in the literature 
(see for example f30|), an analytical continuation has been performed pass- 
ing from real to complex parameters in Q. A short discussion in given in 
appendix C. We have checked, in each instance, that the conditions allowing 
the analytical forms are satisfied. 

• Technically, Pade Approximants, though they often approximate data very 
well and enjoy numerous properties are not "robust", in the following sense. 
When the powers M and iV are changed, the optimal values of the Pade 
parameters approximating in the best way the data (see (^) below) can 
vary widely. In other words, the optimal value of the parameters for a given 
choice of M and iV do not predict, in general, in a very simple way, the 
values of the parameters for other choices of M and N. We refer the reader 
to the relevant literature about this point, which has been studied widely 



However, for a given value of M and N, the parameters entering the Pade 
vary smoothly with respect to the lag and to the the maturity. Hence 
the values for different lags and maturities can be compared and fitted 
extremely well to simple forms. 

As we will show, the normalized experimental distributions f(v) ((|5|), (0)) will 
be extremely well approximated by the normalized p[° ,4 l Pade Approximants. 
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4 Criteria, % 2 



In order to estimate the parameters appearing in the Pade Approximants, a 
measure of the distance between the experimental distribution and the Pade 
distribution should be chosen and minimized. 

We have decided to perform a least square fit by defining for the (non- 
normalized) distributions N (see (|J),(H)) a x 2 

*• = (*W*> -"(*>>' (30) 

v=v min -R 

which is discussed in the items below. 

Minimization of the \ 2 will be used to determine the estimates of the param- 
eters rii and qi entering in the Pade Approximants. In the formula (0), we have, 
for simplicity, suppressed all the indices referring to the lag L, the maturity [m] 
and the Pade indices [M, N] . 

Let us explain and comment on this formula as applied to our case 

• -Pbare(^) is the relevant Pade Approximant computed at the relevant position 
but normalized to the total number of data points instead of one. The 
criteria itself is a pure number and hence independent of the unit chosen 
to make the computations. If the bare N(v) are arbitrarily multiplied by // 
both the bare P's and the cr's should be multiplied by the same \i. 

The normalized Pade Approximant P(v) is thus given in terms of the bare 
Pbam(v) as 

m = p^tfi (31) 

in units [year/b.p.] for the variable v expressed in units [b. p. /year]. 

• The sum runs on the integer discrete integer positions v (in units [b. p. /year] 
which we have chosen to be our natural binning) extending at least from the 
first instance v m i n where N(v) is non zero to the last position v max where 
it is non zero. The inclusion of the integer R > in (^) allows one to 
take into account in the summation an extended range of v where we know 
experimentally that N(v) is zero. Evidently, the summation in (|30"|) could 
or should in principle be extended to the full domain [— oo, +00] . Outside 
the initial range [v m i n ,v max \, N(v) is zero and P(v) is small but non zero. 
In fact, restricting oneself to the range [v min — R, v max + R] does not change 
the fits very significantly. For large absolute values of v, many bins have 
zero observations. For this reason, the empirical characterisation of the 
probability laws is sometimes performed with cumulative distributions (see 
0). In order to keep the implementation as simple as possible and to avoid 
numerical integrations, we have decided to deal with probability densities. 
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In order to make sense we have also to chose the "errors" a(v) in the data in 
a suitable way. Normally, in the absence of knowledge on the experimental 
errors on the N(v), it is customary to chose all the a(v) equal to the same 
value. Indeed, the "experiment" is a one time experiment and cannot be 
reproduced at will. 

However, in our case, we think that we can argue that we have some indica- 
tions on the errors. If the interest rates were purely statistically produced, 
we would expect, in first approximation a random distribution around the 



theoretical curve and a{v) for given v to be of the order of JN(v), i.e. 



if N(v) ^ then a(v) = \JN(v) . (32) 

This is not exactly the case but seems to be a reasonable first guess. Small 
variations about this value, due to the non randomness of the data are 
obviously not important and will influence very little the determination 
of the parameters obtained by minimizing the x 2 - When N(v) = 0, the 



appearance of JN(v) = in the denominator does not make sense. We 
then naturally chose cr(t>) = 1, for the v such that N(v) = (the minimal 
error on an integer number) 

if N(v) = then a(v) = 1 . (33) 

In the following section, we apply the ideas of the two last sections to our 
data set. 



5 Data Analysis 

5.1 Presentation and Statistical Description of the Data 

The raw data we have used are the American daily spot interest rates for constant 
maturities equal to one, two, three, five, seven, ten and thirty years 

H = [1], [2], [3], [5], [7], [10], [30] (34) 

between February 15, 1977 and August 4, 1997. Altogether 7 rates for each of 
the 

N tot = 5108 (35) 
opening days considered. They are calculated from bond prices, published by the 



Board of Governors of the Federal Reserve System 1 1] in the United States of 
America and freely available on their Web site. 

In Table 1, we have given some basic statistical results which can be obtained 
directly from the data : in particular the mean , the variance, the skewness and 
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the kurtosis for all available maturities and for lags L = 1,5, 10, 15, 20, 25, 30. We 
have restricted ourselves to rather small time scales (up to thirty business days). 
For longer periods, the sizes of the samples become too small and the mean values 
cannot be neglected safely. All the variations in interest rates exhibit the same 
general behaviour : increasing variance and decreasing leptokurticity for larger 
lags. The small assymmetries in the data are obviously not reproduced by the 
even Pade Approximants that we have selected. 

5.2 Generalities 

Taking into account our general consideration given above, the precise x 2 we have 
chosen to minimize is 

2 _ ^ R (iW*>) - ^)) 2 

v=v min -R e ^ i 

where N e (v) is defined as N(v) when it is non zero and 1 otherwise. 
Let us call f e (v) the ratio 

Uv) = (37) 

^tot — L< 

which is equal to f(v) (see (|J)) when it is non zero but equal to l/(N tot — L) 
when f(v) =0. 



The final form of the criteria is 



2 



v max +R (P(v) - f(v))' 

X 2 = (N tot -L)x £ V ' ■ (38) 

v=v min -R Je\ v ) 

5.3 Fits with Pade [0,4]. Estimation of the Parameters 
for all the Values of the Lags and Maturities 

5.3.1 Normalized Pade [0,4] 

We have computed the best fit values of the parameters of a Pade P^'^^v), defined 
in flilf) , (|20f) and (|19D but limited to non zero no,?i,?2- The x 2 dH) has been 
minimized with a normalized Pade (no = ^i/vt) (see (pH|)). 

To that effect, we have used the Fortran IMSL DUNLSF minimization pro- 
gramme based on a modified Levenberg-Marquardt algorithm and a finite differ- 
ence Jacobian. The best values of the parameters s, q% and g 2 (in respective units 
[%/year], [century], [century] 2 ) and of their "one parameter standard error" As, 
Aqi and Aq 2 (in the same units) (see appendix B) have been computed for all 
values of the lag and of the maturity: altogether 210 different cases. 
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The results are given in Table 2 for all maturities but for lags L — 1, 5, 10, 15, 20, 25, 30 
only, to keep a reasonable size for the table. The other values can be obtained 
from the authors. 

Some remarks can be made at this point: 

1. For all the cases, including those which are not reported in Table 2, except 
for the [to] = [2], L — 1 case, the hypothesis that the distribution is a Pade 
[0, 4] cannot be rejected at the 5% confidence level. The exception is due 
to the fact that the experimental curve has an oscillatory behaviour around 
its maximum with three separate peaks. This is atypical as compared to all 
the other cases. We conjecture that this is due to a statistical fluctuation 
and does not correspond to a genuine and reproducible effect in the two 
years maturities. 

2. The modulus of the shift parameter | s | is always very small and can 
essentially be neglected. Indeed, it never exceeds a few [b. p. /year] and 
in most cases, it is not statistically different from zero. This is a simple 
reflection of the fact that interest rates themselves follow cycles which come 
back to their original value. 

This means that, in first approximation, the data are symmetrical under 
the (v ) <-> (— v) exchange, as was foreseen and that the influence of s can 
be safely forgotten. Both the normalization and variance can, well within 
the experimental errors, be equated to the values obtained with s = 0. 

3. For all values of the maturities, the two relevant parameters (and also Hq 
which is by normalization a dependent parameters), q\ and q2 decrease very 
smoothly as a function of the lag, including the values of the lags which are 
not in Table 2. This is a simple reflection of the fact that as the lag becomes 
higher the distribution flattens. To a very good approximation, these de- 
creasing functions are straight lines in a log-log plot. We will estimate the 
parameters of these straight lines in Section 5.4. 

4. As a function of the maturity, the parameters first decrease then become 
higher showing a dip. Hence, the density function shows, first a tendency 
to flatten but becomes sharper when very long maturities are concerned. 
Experimental variations of the interest rates for the two and three years 
maturities seem to be somewhat less "smooth" than for the one year matu- 
rity. In turn, densities for maturities of 30 years are remarkably stable and 
strongly peaked around zero variations. 

5. Even though as a function of lag and maturity together, no very simple 
pattern emerges, the smoothness of the parameters in the separate variables 
is a very welcome fact. 
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6. We have chosen to draw, in Figure 1, the experimental distribution together 
with the one given by the best Pade for the maturity [m] = [1] and the 
lag L = 1 and in Figure 2 the case [m] = [1] and L = 30. In both 
cases, the agreement between the two curves is rather good. The quality 
of the fit reflects the low value of the criteria equivalent to a goodness of 
fit of about one. All the analogous figures for the 210 cases show the same 
agreement. In one case only, the [m] = [2], L — 1 case, the agreement is 
worse, reflecting the higher value of the criteria and a low goodness of fit. 
The related Figure 3 shows the reason for this fact. Instead of presenting one 
maximum, the experimental curve exhibits an oscillatory pattern around 
zero. The density of variation of interest rates is strongly peaked at v = 2, 
v = and v = —2 instead of as is the normal situation. 

7. Though the Pade distributions do not have the properties of the Levy dis- 
tributions, they enjoy with these distributions narrower peaks around their 
maximum and longer tails as compared to the Gaussian distributions (see 
below). Compared to the Levy distributions, they have the advantage of 
being expressed in a very simple rational form. As we have seen, many 
auxiliary computations can be done very easily and often analytically. 

5.3.2 Normalized Pade [0,4] with Constrained Variance 

The estimations of the parameters q<i allows, in principle, to evaluate directly 
the variances (|27|) . It turns out that the Pade variances do not always equate 
the values obtained directly from the data. As one can verify directly from 
Table 1 and Table 2, there is no systematic pattern in the deviations : for short 
maturities (1, 2 and 3 years) and large lags, the Pade variances underestimate 
the sample variances. The adequacy is good for intermediate maturities. For the 
longest maturity (30 years), the Pade variances overestimate slightly the sample 
variances. 

These deviations are difficult to interpret. We have thus redone our computa- 
tions using the parameter q2 constrained to —1/ [Sample Variance). Even if the 
minimal values obtained for the criteria are higher than in the non-constrained 
cases, the value of the criteria remains fully acceptable except again in the 
[m] — [2], L — 1 case which produces a too high y 2 . Again, this exception 
is due to the fact that the data for [m] = [2], L = 1 are atypical and that the 
experimental curve exhibits more than one maximum. 

The parameters s are again rather small and, in most cases, not statistically 
different from zero. The values of the parameters s, q\ and of x 2 an d of their 
standard errors are given in Table 3. We have restricted ourselves again to the 
lags L = 1,5, 10, 15, 20, 25, 30 to keep the table of reasonable size. For the other 
lags, the parameter g x interpolates smoothly. 
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5.3.3 Comparison with the Gaussian Distribution 



In order to compare the two cases corresponding to fits of Pade Approximants 
(with the constrained normalization and the variance constrained or not), we 
have performed the same computation for a Gaussian distribution of the form 

G(v) = -/^e^ (39) 



with two free parameters a and s for every lag and for every maturity. The 
Gaussian distributions lead to high \ 2 as compared to the Pade Approximants. 
For the non-constrained estimation, the hypothesis of a normal distribution is 
always rejected for lags up to five days and, in most cases, for the [m] = [30] 
maturity. The no-rejection of the Gaussian model for some large lags is linked 
to a decrease in the leptokurticity with increasing time scale. These results are 
coherent with similar observations on the FX market (see |27| ) . If one constrains 



the standard deviation a to its sample value, the normality is strongly rejected 
for all [m] , L cases. 

As an example we have drawn in Figure 4 the curve for L = 1 and [m] = [1] 
compared to the experimental points. This is obviously worse than the corre- 
sponding comparison for Pade [0,4] given in Figure 1. The Gaussian distribution 
is clearly wider for low v and drops more sharply for higher | v | than the data 
points. The Pade curve is narrower for low v but drops less sharply for higher 
| v | and hence fits the data much better. All the [m] , L cases present the same 
behaviour. 



5.4 Fits with a Normalized Pade [0,4]. Variation of the 
Parameters as a Function of the Lag for Different Ma- 
turities 

The parameters g[ m '(L) and q^\L) at their best value are simple functions of 
the lag L. More precisely, in a log-log plot, we can approximate them by a linear 
model of the form 

Hl [ r\L)) = Ai m] ln(f ) + v [ ; n] 
H-^\L)) = ASr ] ln(f ) + ut ] . (40) 

The justification of the arbitrary factor Lq which will be taken equal to 15 is 
given in Appendix B where a precise discussion of the related standard errors is 
given. 

The q\ m \L) are expressed in some units (see Appendix A). If these units are 
changed, A[ m ' does not change while vf^ picks up, obviously, an additive constants 
(the logarithm of the ratio between the two units). 
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For the estimation of A| and a\ , we have to define new x 2 (C| ) f° r every 
gi and every maturity [m]. This takes into account the errors we have obtained 
on the qi. They are 
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cp = 



i' 1 '" 1 I I \ V'" 1 \">\, i < ' 



L=l 



(Agr(L)) 



2 



C ml = ^ '—. (41) 



L=l 



In Table 4, the values of the best A^ and v\ minimizing C\ and their 
one standard error AAj"^ and Az/j" 1 ^ are given for all maturities. In Figure 5 
and Figure 6, the straight lines ( flOD are plotted for [m] = [1] and compared to 
the data points. The fit is clearly excellent for ln(gf) and somewhat worse for 

H-i ] ). 

The parameters A 2 are related to the so-called "scaling laws" for the time 
dependence of the standard deviations 

a [m] (L) = k^L Elm] . (42) 



This behaviour is reported for most of the financial time series (see ||25|| , |P-3|| , 
[ P7| ) with an exponent typically close to but larger than 0.5, the value which 
should be observed for a Gaussian process. With our notations £?H = -A^ 1 / 2 - 
The absolute value of A 2 , statistically greater than 1, lead to scaling exponents 
ranging from 0.52 to 0.54. 



5.5 Discussion of Fits with Pade Approximants with In- 
dices higher than [0, 4] 

Since the Pade Approximants with indices [0, 4] lead to such good fits to the data, 
it is tempting to see if these fits can even be made better by using higher indices. 
We have tried to do so and have come to the conclusion that [0, 4] is close to the 
optimum. 

1. First, the Pade Approximants [M, N], when the condition of the existence of 
a finite variance is not satisfied fli~4]), do not seem to be very probable. Those 
with M=N clearly are not convenient as they cannot even be normalized. 

2. When the condition of finite variance is satisfied (14), Pade with indices 



higher that [0, 4] obviously produce somewhat better fits and hence lower 
values for the \ 2 - But this statement has to be qualified by considering in 
turn the different possibilities. 
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(a) When [0, 6] is used, the parameter g 3 , new with respect to our first 
choice, is significantly different from zero in few cases only. The x 2 is 
not improved in most cases. This means that the [0, 4] approximation 
is already able to cope with the decrease of the experimental density 
function. A v~ A is sufficient to achieve it. Our careful, but, may be, 
slightly prejudiced analysis of the data even seems to point to the fact 
that the v ~ 4 decrease is the fastest which can be allowed by the data. 
Indeed, the fit to the variance (—1/(72) coming out of the minimization 
already tends to come out too small as compared to the raw variance 
of the data, for maturities up to three years. Remember that this 
raw variance depends crucially on the end points of the experimental 
distribution and hence is not determined very precisely. For pi 0,6 !, the 
moment of fourth degree (numerator of the kurtosis) then becomes 
finite which may be interesting theoretically though to repeat, we be- 
lieve that the data point in the infinite kurtosis direction. The same 
findings were achieved on the FX rates in [J/J. 

(b) We have also tried to fit the experimental data with the other Pade's 
([0, 8], [2, 8], [4, 8] and [2, 6]). In the majority of the cases, no significant 
improvement of the x 2 is obtained. In some cases, the minimization 
algorithm fails [2, 6]. In a few cases our minimizations lead to a smaller 
value for the criteria. But, the price to pay is that the Pade curve 
starts to try to fit small oscillations in the experimental curves [0,8], 
[2,8] and [4,8]. These oscillations are obviously due to the finiteness 
of the sample and do not very likely correspond to intrinsic structures. 
Moreover, the minimum obtained are usually very sensitive to the 
starting values and the minimal parameters do not exhibit the stability 
they show in the [0, 4] case. 



6 Conclusions 

In this paper, we have presented and discussed phenomenological fits of the daily 
variations of the term structure of interest rates (taking as an example those 
published by the Board of Governors of the Federal Reserve System [|11||). We 



have shown that simple Pade Approximants, which are theoretically well suited 
for approximating in a rather smooth way many classes of functions, fit the 
experimental results amazingly well. 

The best values of the parameters defining the Pade Approximants vary rather 
smoothly as a function of the lag and of the maturity. In particular, as a function 
of the lag, in a log-log plot the parameters are very well represented by straight 
lines. This can be related to scaling laws reported for other financial time series. 

We have also shown that, the simplest Pade Approximant pt ' 4 !, which de- 
pends essentially on only two meaningful parameters, has enough richness to 



17 



represent the data faithfully and smoothly. The extension to higher Pade Approx- 
imants (for example to pi ' 6 !) is by no means necessary. The Pade Approximants 
have then a tendency to mimic small oscillations in the data. These oscillations 
are clearly the results of the finiteness of the sample and do not seem to be related 
to any "real" underlying structure. 

Since Pade Approximants are simple rational functions, they can be easily 
written, tabulated, and, once the best parameters have been obtained, used for 
further applications. We hope to come back to this idea in the near future and 
show how these explicit results can be used for estimating risks related to extreme 
events. 

A Dimensions of the Pade Parameters 

A short appendix about units and dimensions is useful here. Interest rates have 
the dimension of the inverse of time, [[t]] _1 , are usually given in [%/year] with 
exactly two meaningful digits and hence are expressed in [b. p. /year] as an integer 
number. 

• This makes [b. p. /year] = 10 _4 /year a very natural unit to use both for 
the interest rates but even more for the variation in the interest rates. It 
gives a natural binning for the discretized distributions. The inverse unit 
of [b. p. /year] is [year/b.p.] which is the time scale of 10 4 years. This is the 
unit we have consistently used in the main body of the paper. 

• In the tables and in the figures, the different parameters as well as the 
means and the variances are expressed as powers of the more conventional 
unit [%/year] and of its inverse [year/%] which is a [century]. They are 
units of well adapted sizes. 

• In the following table, we quote the dimension [[x]} of the quantities x we 
have been using, the units [x] we have been using in the main part of the 
text and the units we have used in the tables. 

Table of Dimensions and Units 
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• It is easy to pass from variables expressed in one unit to the variables 
expressed in the other unit by using the correspondence 

[%/year] 
[b.p./year] = — 

[year/b.p.] = 100 x [year/%] = 100 x century. (43) 

• For example, the value of expressed in units [year/b.p.]* becomes 100* x q i 
in the more customary unit [year/%]* while n» becomes 100* +1 x n^. The 
Pade density P(v,rii,qi) (where we have written explicitly the parameters 
(p~8|) , (|19|) ) in units [year/b.p.] for v expressed in [b.p./year] is related to the 
density p new [y^ where P new is expressed in [century] in terms of v expressed 
in [%/year] by 

P new (v) = P(w; 100 m n 4 , 100*gi) • (44) 



B Discussion of the Errors 

A short discussion on the determination of the errors is worthwhile giving. The 



X 2 ((38) or (41)), depend on the Pade or on the A, v parameters. Let us call these 
parameters generically p = {p± , P2 , ■ ■ .}. The best values of the parameters p best 
are obtained by minimizing the criteria x 2 (p best ) = Xmin- The confidence regions 
in one parameter with confidence level p = 68.3% are obtained for the contour 

Ax 2 = X 2 (p)-xL n = l- (45) 
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For errors on two parameters jointly, the contour is given by 



A X 2 = 2.3. (46) 

From the computation of the matrix H of the second derivatives (the Hessian) 
of the Criteria \ 2 & t its minimum and from the Taylor expansion of x 2 {p) (to 
second order in the parameters) one finds 

A X 2 = iffyfo - pf^fo. - pf st ) . (47) 

• If this ellipsoid has its principal axis reasonably aligned along the parameter 
axis (which is the case for the parameters we have chosen as can be seen 
on Figures 7, 8 and 9) one can define the "one parameter standard error" 
Api, for a given parameter as 

A Pi =\ Pi -pt est \=^-. (48) 

Notice that the classical interpretation of standard errors is only valid if the 
measurements errors are normally distributed. If they are not, a suitable 
contour of constant \ 2 ((0)) (EH)) should be used as boundary of the 
confidence region (see f29|) for a more precise discussion). 



If this ellipsoid does not have its principal axis reasonably aligned along 
the axis, the "one parameter error" is obtained from the maximal M, and 
minimal values of the fixed parameter pi in the region A\ 2 < 1 as 

Mi—rrii . , 

Ap, = - . (49) 



It should be remarked that for (|4lD , the (obvious) value Lq = 15 is close to 
the optimal value which makes the principal axis of the ellipse parallel to the 
parameter axis. 

As examples, the curves corresponding to A% 2 = 1 and A% 2 = 2.3 have been 
drawn for the parameters qi, q2 in the [m] — [1], L — 1 case in Figure 7, for the 
parameters Ai andz/x for [m] = [1] in Figure 8 and for the parameters A2 andz/ 2 
for [in] = [1] in Figure 9 . 



C Analyticity of the Denominator 

The Normalization (^l|), the Variance (^) and the Kurtosis (^) have been com- 
puted analytically from the form we have given for the denominator of the Pade 
Approximant (|T9"D. There is a restriction for these formulas to be valid, namely 
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that the poles of Q n {y) be all situated in, say, the upper-half plane of the complex- 
ified variable v. All the poles of Q n *(v) are then situated in the lower half-plane. 
If the poles have complex values Vj = s + cij + ibj, j = 1, ... ,4, with bj > 0, 
Q n {y ) is written 

n v — s 

or(v) = u(i+i—). (so) 

i=i v i 

Expanding the products and taking account of the positivity of the bj, it is 
easy to see this implies for Q 3 the necessary conditions are 

qi > 0, q 2 < 
<?3 < 0, q 3 - q x q 2 > . 

We expect that these conditions, for Q 4 , are supplemented by the conditions 

<?4 > o, q 2 q 3 - qm > 

q^qs ~ q&4 ~ q\ > 

Moreover, written in terms of the poles, we have verified that all the analytical 
forms of the Normalization, of the Variance and the Kurtosis lead to positive 
values and hence are fully meaningful. 
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Table Captions 

Table 1 

Table of Univariate Statistics for the daily changes of the American Spot 
Interest Rates (expressed in the relevant [%/year] unit) between February 
15, 1977 and August 4, 1997. The statistics are given for all available 
maturities [m] = [1], [2], [3], [5], [7], [10], [30] and for the representative subset 
of lags L — 1, 5, 10, 15, 20, 25 and L = 30. The quantities are expressed in 
the relevant [%/year] unit. See appendix A. 

Table 2 

The optimal values of the parameters q± , q 2 and s appearing in the de- 
nominator of the Pade p[°' 4 l together with their "errors" Agi, Aq 2 and 
As (for the units chosen, see appendix A, for the interpretation of the 
errors, see appendix B). The parameters no are constrained by the nor- 
malization. These optimal parameters are given for all available maturities 
[m] = [1], [2], [3], [5], [7], [10], [30] and for the representative subset of lags 
L — 1, 5, 10, 15, 20, 25 and L = 30. The parameters for the other values of 
the lag have been computed but are not given to limit the size of the table. 
They vary smoothly and have been taken into account to estimate the pa- 
rameters Aj and z/j appearing in Table 4. The minimal value of the Criteria 
X 2 (a pure number) is also given. For all the cases, including those which 
are not reported in the table, except for the [m] = [2], L = 1 case whose 
X 2 is marked with a * in the table, the hypothesis that the distribution is 
a Pade [0,4] cannot be rejected at the 5% confidence level. 

Table 3 

The optimal values of the parameters q\ and s appearing in the denominator 
of the Pade p[°> 4 ] together with their errors Agi, As (for the units chosen, 
see appendix A, for the interpretation of the errors, see appendix B). In this 
table, the values of no are constrained by the normalization and the values of 
q 2 by the experimental variance. These parameters are given for all available 
maturities [m] = [1], [2], [3], [5], [7], [10], [30] and for the representative subset 
of lags L = 1,5, 10, 15,20,25,30. The parameters for the other values of 
the lag have been computed but are not given to limit the size of the table. 
They vary smoothly. The minimal value of the Criteria x 2 ( a pure number) 
is also given. For all the cases, including those which are not reported in 
the table, except for the [m] = [2], L = 1 case whose x 2 is marked with a 
* in the table, the hypothesis that the distribution is a Pade [0, 4] cannot 
be rejected at the 5% confidence level. 

Table 4 
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The optimal values of the A ?1 , u qi , X q2 , v q , 2 parameters and their standard 
errors AA 9l , Au qi , A\ q2 , Au q2 as functions of the maturity (for the inter- 
pretation of the errors, see appendix B). 



25 



Figure Captions 

Figure la 

The experimental distribution compared with the one given by the best 
Pade [0,4] for maturity [m] = [1] and lag L — 1. 

Figure lb 

The difference between the experimental distribution and the distribution 
obtained by the best Pade [0,4] for maturity [m] = [1] and lag L = 1. 
Comparing this figure to Figure la, one sees that, in absolute value, the 
deviation between the data and the approximation is larger in the central 
part of the distribution but with no systematic bias. 

Figure lc 

The ratio R between the experimental density and the best Pade [0, 4] 
density for maturity [m] = [1] and lag L — 1. This figure zooms the 
tails as compared to the central values. One sees that, in relative size, the 
central part is very well reproduced. For absolute values of v higher than 
0.4, the scarcity of the data is reflected by the appearance in the plot of 
the horizontal axis and of curves rising as v 4 . These correspond to the rare 
events N(v) = 0, 1, 2, 3 (see (H)) in the tails of the distribution of variations. 
In this region, the Pade distribution averages the data. 

Figure 2 

The experimental distribution compared with the one given by the best 
Pade [0,4] for the maturity [m] = [1] and the lag L = 30. 

Figure 3 

The exceptional case of the experimental distribution compared with the 
one given by the best Pade [0, 4] for the maturity [m] = [2] and the lag 
L = 1. The experimental points exhibit three very large oscillations in the 
region around v = showed in the figure. We have highlighted this fact by 
arbitrarily drawing the dotted line. These oscillations may be of statistical 
nature as they do not show up in any of the 209 other cases that we have 
studied. 

Figure 4a 

The experimental distribution compared to the Gaussian distribution for 
the maturity [m] = [1] and the lag L — 1. The experimental distribution is 
more strongly peaked than the Gaussian distribution which itself has too 
small tails. 
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Figure 4b 

The difference between the experimental distribution and the Gaussian dis- 
tribution for the maturity [m] = [1] and the lag L — 1. This figure is worth 
comparing with Figure lb. One sees that, in absolute value, the deviation 
between the data and the approximation is large in the central part of the 
distribution but also shows a systematic bias in the region 0.2 <| v |< 0.4. 

Figure 4c 

The ratio R between the experimental density and the best Gaussian density 
for maturity [m] = [1] and lag L — 1. This figure zooms the tails as 
compared to the central values. One sees that, in relative size, neither the 
central part nor the tails are very well reproduced. For absolute values of v 
higher than 0.2, R is seen to shoot up extremely fast. Hence, the tails are 
very badly fitted by a Gaussian. 

Figure 5 

The straight line In ((ft) (see (f40|)) as a function of ln(L/L ) compared with 
the data points for [m] = [1]. The parameters are obtained from the best 
fit with a normalized Pade [0.4]. 

Figure 6 

The straight line ln(— q 2 ) (see (^0|)) as a function of ln(L/L ) compared 
with the data points for [m] = [1]. 

Figure 7 

The ellipses corresponding to A% 2 
parameters (ft , q 2 appearing in the 
qi and q 2 are found in Table 2. 

Figure 8 

The ellipses corresponding to A% 2 = 1 and A% 2 = 2.3 for the errors in the 
parameters Ax, V\ appearing in the linear model of hi((ft) as a function of 
ln(L/L ) in the [m] = [1] case. The estimates of Ai and vi are found in 
Table 4. 

Figure 9 

The ellipses corresponding to A% 2 = 1 and A% 2 = 2.3 for the errors in the 
parameters A2, v 2 appearing in the linear model of ln(— q 2 ) as a function of 
ln(L/Lo) in the [m] = [1] case. The estimates of A2 and u 2 are found in 
Table 4. 



= 1 and Ax 2 = 2.3 for the errors in the 
[m] = [1] , L — 1 case. The estimates of 
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Table 1 

Table of Univariate Statistics 
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L 


1 


5 


10 


15 


20 


25 


30 




Mean 


0.0000 


0.0001 


0.0000 


0.0001 


0.0001 


0.0002 


0.0004 




Variance 


0.0143 


0.0860 


0.1966 


0.3326 


0.4836 


0.6446 


0.8136 


M=[i] 


Skewness 


-0.1541 


-0.8223 


-1.0587 


-1.0710 


-1.1273 


-1.1413 


-1.1594 




Kurtosis 


14.5664 


10.7279 


10.4445 


10.9208 


10.9326 


10.9088 


10.8980 




Vmin 


-1.0800 


-2.2700 


-3.0600 


-4.0700 


-5.4500 


-6.3100 


-6.9100 




Vmax 


1.1000 


2.0600 


2.4700 


3.1400 


3.2000 


3.7200 


4.0100 




Mean 


0.0000 


-0.0002 


-0.0005 


-0.0007 


-0.0009 


-0.0009 


-0.0010 




Variance 


0.0113 


0.0702 


0.1594 


0.2663 


0.3836 


0.5109 


0.6456 


H=[2] 


Skewness 


-0.3648 


-0.7398 


-0.8722 


-0.8171 


-0.8547 


-0.8600 


-0.8741 




Kurtosis 


12.4522 


9.7384 


9.1389 


9.2421 


8.8631 


8.5522 


8.3537 




Vmin 


-0.8400 


-2.0800 


-2.7900 


-3.3700 


-4.4900 


-5.2300 


-5.9600 




Vmax 


0.8900 


1.9900 


2.6000 


2.8700 


3.0900 


3.3900 


3.7100 




Mean 


-0.0001 


-0.0004 


-0.0010 


-0.0015 


-0.0019 


-0.0022 


-0.0025 




Variance 


0.0102 


0.0622 


0.1374 


0.2256 


0.3225 


0.4285 


0.5421 


H=[3] 


Skewness 


-0.1628 


-0.4432 


-0.5977 


-0.5000 


-0.5254 


-0.5487 


-0.5928 




Kurtosis 


10.4136 


7.7748 


7.6880 


7.4571 


6.7799 


6.4504 


6.3316 




Vmin 


-0.7900 


-1.5700 


-2.8300 


-3.0500 


-3.5800 


-4.3200 


-5.1200 




Vmax 


0.9200 


1.9900 


2.6000 


3.0400 


3.3200 


3.6000 


3.6500 




Mean 


-0.0001 


-0.0007 


-0.0016 


-0.0024 


-0.0032 


-0.0038 


-0.0044 




Variance 


0.0091 


0.0544 


0.1182 


0.1901 


0.2661 


0.3491 


0.4409 


M=[5] 


Skewness 


-0.3064 


-0.3938 


-0.4482 


-0.3617 


-0.3448 


-0.3109 


-0.3025 




Kurtosis 


8.7216 


6.2382 


5.9828 


5.9650 


5.5396 


4.8740 


4.4440 




Vmin 


-0.7700 


-1.5400 


-2.3900 


-2.5800 


-3.0700 


-3.6000 


-4.3200 




^max 


0.7200 


1.6400 


2.3600 


2.8200 


3.0800 


3.3800 


3.5100 




Mean 


-0.0002 


-0.0010 


-0.0021 


-0.0031 


-0.0040 


-0.0048 


-0.0056 




Variance 


0.0085 


0.0491 


0.1044 


0.1651 


0.2287 


0.2981 


0.3749 


[m]=[7] 


Skewness 


-0.3066 


-0.3907 


-0.4981 


-0.3602 


-0.3019 


-0.2683 


-0.2637 




Kurtosis 


8.1780 


5.3213 


5.4510 


5.2149 


4.6201 


3.9410 


3.5034 




Vmin 


-0.7800 


-1.3600 


-2.4000 


-2.5500 


-2.9400 


-3.1100 


-3.6200 




Vmax 


0.7000 


1.5300 


2.0300 


2.6000 


2.8000 


3.1100 


3.2600 




Mean 


-0.0002 


-0.0012 


-0.0025 


-0.0037 


-0.0049 


-0.0060 


-0.0070 




Variance 


0.0076 


0.0440 


0.0928 


0.1449 


0.1984 


0.2574 


0.3224 


H=[io] 


Skewness 


-0.2817 


-0.5431 


-0.6719 


-0.4999 


-0.3918 


-0.3246 


-0.3022 




Kurtosis 


6.9688 


5.4452 


5.7715 


5.1414 


4.1992 


3.4414 


3.0903 




Vmin 


-0.7500 


-1.3600 


-2.3500 


-2.6000 


-2.6200 


-2.8400 


-3.3700 




Vmax 


0.6500 


1.3600 


1.8500 


2.3600 


2.5500 


2.8300 


2.9700 




Mean 


-0.0002 


-0.0013 


-0.0027 


-0.0039 


-0.0052 


-0.0062 


-0.0073 




Variance 


0.0062 


0.0349 


0.0723 


0.1114 


0.1518 


0.1972 


0.2475 


[m]=[30] 


Skewness 


-0.2245 


-0.4463 


-0.4792 


-0.3334 


-0.2077 


-0.1437 


-0.1078 




Kurtosis 


6.8619 


4.5314 


4.0574 


3.7200 


3.1661 


2.6781 


2.3830 




Vmin 


-0.7600 


-1.3100 


-1.8800 


-2.1600 


-2.3800 


-2.2000 


-2.4800 




Vmax 


0.5000 


0.9500 


1.3800 


1.6500 


2.0800 


2.3700 


2.4900 
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Table 2 

Parameters for the Normalized Pade Approximant [0,4] 
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L=\ 


L=5 


£=10 


£=15 


£=20 


L=25 


L=30 




x 2 


140.59 


310.90 


446.62 


508.29 


668.08 


660.21 


665.98 




qi 


25.253 


8.633 


5.655 


4.474 


3.648 


3.162 


2.885 


[m] = [l] 


A qi 


0.518 


0.171 


0.110 


0.085 


0.067 


0.062 


0.056 




92 


-70.861 


-16.802 


-9.145 


-5.472 


-4.053 


-2.867 


-2.125 




Aq 2 


4.987 


0.814 


0.369 


0.257 


0.177 


0.130 


0.110 




s 


-0.00071 


0.00549 


0.00968 


0.01143 


0.02594 


0.01579 


0.01005 




As 


0.00078 


0.00227 


0.00331 


0.00419 


0.00495 


0.00584 


0.00654 




x 2 


367.28* 


275.87 


361.33 


453.41 


527.96 


521.65 


534.96 




qi 


24.073 


8.022 


5.139 


3.940 


3.161 


2.749 


2.511 


[m] = [2] 




0.475 


0.163 


0.100 


0.074 


0.058 


0.052 


0.047 






-84.096 


-17.733 


-9.349 


-5.917 


-4.286 


-3.023 


-2.204 






4.744 


0.683 


0.338 


0.227 


0.151 


0.109 


0.086 




s 


-0.00009 


0.00506 


0.01052 


0.01278 


0.02027 


0.01102 


-0.00135 




As 


0.00081 


0.00240 


0.00349 


0.00434 


0.00512 


0.00616 


0.00709 




x 2 


123.82 


272.21 


324.87 


372.72 


504.58 


513.08 


514.94 






24.291 


7.850 


5.099 


3.8345 


3.158 


2.765 


2.528 


[m] = [3] 




0.497 


0.161 


0.100 


0.073 


0.057 


0.051 


0.047 






-89.184 


-18.659 


-8.811 


-5.438 


-4.091 


-2.894 


-2.132 




Aq 2 


4.989 


0.674 


0.309 


0.206 


0.153 


0.107 


0.089 




s 


0.00016 


0.00512 


0.00563 


0.00477 


0.01984 


0.01253 


0.001 




As 


0.00083 


0.00241 


0.00359 


0.00458 


0.00523 


0.00625 


0.00711 




x 2 


130.61 


205.42 


280.57 


330.08 


428.37 


500.15 


501.82 




qi 


24.225 


7.976 


5.295 


4.099 


3.387 


2.988 


2.758 


[m] = [5] 


A qi 


0.510 


0.159 


0.103 


0.079 


0.063 


0.056 


0.051 




92 


-96.702 


-18.222 


-9.425 


-5.844 


-4.391 


-3.092 


-2.313 




Aq 2 


4.700 


0.724 


0.356 


0.233 


0.166 


0.120 


0.103 




s 


0.00006 


0.00206 


0.00659 


0.00236 


0.00124 


-0.00026 


-0.00424 




As 


0.00082 


0.00240 


0.00346 


0.00439 


0.00509 


0.00599 


0.00669 




x 2 


124.42 


170.78 


260.73 


335.07 


339.22 


367.99 


516.64 




qi 


23.745 


8.038 


5.602 


4.355 


3.546 


3.103 


2.945 


[m] = [7] 


A qi 


0.492 


0.161 


0.111 


0.085 


0.066 


0.059 


0.053 




92 


-100.715 


-19.25 


-9.537 


-6.209 


-4.393 


-3.324 


-2.860 




Aq 2 


4.739 


0.751 


0.375 


0.244 


0.174 


0.131 


0.132 




s 


-0.0004 


0.00177 


0.00447 


0.00245 


-0.00041 


0.00133 


-0.00934 




As 


0.00084 


0.00238 


0.00338 


0.00423 


0.00506 


0.00582 


0.00607 




x 2 


137.93 


175.08 


229.76 


319.02 


377.43 


321.47 


499.17 




9i 


24.505 


8.527 


5.815 


4.585 


3.824 


3.291 


3.126 


[m]=[10] 


A<?i 


0.505 


0.169 


0.113 


0.091 


0.073 


0.064 


0.058 




92 


-109.545 


-21.157 


-10.172 


-7.088 


-5.026 


-3.594 


-3.114 




Aq 2 


5.035 


0.825 


0.405 


0.271 


0.193 


0.143 


0.139 




s 


-0.00074 


0.00204 


0.00706 


0.00224 


0.0033 


0.00055 


0.00475 




As 


0.00081 


0.00225 


0.00326 


0.00399 


0.00474 


0.0056 


0.00581 




x 2 


102.78 


166.29 


238.66 


280.98 


337.20 


299.56 


464.49 




9i 


27.439 


9.848 


6.878 


5.481 


4.548 


3.912 


3.656 


[m]=[30] 


A qi 


0.579 


0.196 


0.135 


0.107 


0.087 


0.075 


0.070 




92 


-128.242 


-25.041 


-12.123 


-7.921 


-5.759 


-4.234 


-3.398 




Ag 2 


5.661 


1.028 


0.518 


0.340 


0.257 


0.189 


0.167 




s 


-0.00051 


0.00134 


0.00463 


0.00145 


0.0018 


0.00408 


-0.00445 




As 


0.00073 


0.00199 


0.00283 


0.00353 


0.00419 


0.0049 


0.00525 
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Table 3 

Parameters for the Normalized Pade Approximant [0,4] with Constrained Variance 







L=\ 


L=5 


L=10 


L=15 


L=20 


L=25 


L=30 




A 


140.62 


353.13 


573.23 


590.96 


765.57 


736.57 


719.84 


tail = [11 


yi 


25 262 


8.965 


6.130 


4.807 


3.968 


3.425 


3.049 




A(7i 


0.517 


0.172 


0.112 


0.085 


0.067 


0.061 


0.055 






-0 00071 


00421 


00633 


00564 


0.02711 


00749 


00832 




As 


00078 


00231 


00346 


00442 


00541 


00612 


00682 




A 


368.13* 


305.13 


457.03 


552.25 


660.94 


622.24 


585.35 


tall = [21 


yi 


24.056 


8.163 


5.403 


4.190 


3.395 


2.928 


2.619 




A(7i 


0.475 


0.166 


0.103 


0.075 


0.059 


0.054 


0.049 






-0 00008 


00456 


01033 


01152 


01581 


00845 


-0 00263 




As 


00081 

\J • \J\J \JKJ -L 


00241 


00363 


00469 


00568 


00659 


00740 




A 


126.50 


288.11 


352.29 


398.49 


550.75 


542.31 


525.85 


[ml =[31 

|1I±J |OJ 


yi 


24.260 


7.928 


5.163 


3.905 


3.244 


2.816 


2.550 




A(7i 


0.497 


0.163 


0.102 


0.073 


0.057 


0.058 


0.052 




g 


00013 


00549 


00661 


00538 

\J • \J\J \J *J KJ 


01881 


01162 


00225 




As 


00082 


00242 


00363 


0.00471 


00552 


00644 


00727 




X 2 

A 


137.99 


205.47 


288.40 


336.55 


443.68 


503.88 


502.02 


[ml -[51 

|I11J \0\ 


yi 


24.276 


7.975 


5.328 


4.135 


3.429 


2.998 


2.760 




Aoi 
*— *yi 


0.509 


0.159 


0.103 


0.079 


0.063 


0.056 


0.051 






00005 


00203 


00689 


00251 


00088 


00006 


-0 00402 




As 


00081 


00240 


00349 


0.00444 


00520 


00604 


00671 




x 2 


136.33 


172.98 


260.75 


335.45 


339.23 


368.04 


516.76 


[m]=[7] 


Qi 


23.817 


8.032 


5.601 


4.360 


3.547 


3.102 


2.958 




Agi 


0.490 


0.161 


0.111 


0.085 


0.066 


0.059 


0.053 




s 


-0.00046 


0.00171 


0.00447 


0.00265 


-0.00040 


0.00123 


-0.00896 




As 


0.00083 


0.00237 


0.00338 


0.00424 


0.00506 


0.00581 


0.00614 




x 2 


153.87 


178.63 


231.94 


319.50 


377.44 


325.44 


499.18 


[m]=[10] 


qi 


24.630 


8.527 


5.810 


4.592 


3.823 


3.286 


3.127 




Agi 


0.505 


0.169 


0.113 


0.091 


0.073 


0.064 


0.058 




s 


-0.00077 


0.00189 


0.00722 


0.00234 


0.00328 


0.00012 


0.00480 




As 


0.00080 


0.00224 


0.00324 


0.00400 


0.00474 


0.00556 


0.00581 




x 2 


126.60 


177.63 


248.66 


289.85 


346.74 


317.15 


478.07 


[m]=[30] 


qi 


27.836 


9.876 


6.888 


5.486 


4.536 


3.909 


3.638 




Agi 


0.578 


0.195 


0.135 


0.107 


0.087 


0.075 


0.070 




s 


-0.00060 


0.00109 


0.00430 


0.00123 


0.00167 


0.00275 


-0.00657 




As 


0.00071 


0.00197 


0.00280 


0.00349 


0.00412 


0.00481 


0.00516 
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Table 4 

Parameters of the Pade [0, 4] as a Function of the Lag for Different Maturities 





M = [i] 


[m\ = [2] 


M = [3] 


[m\ = [5] 


M = [7] 


[m] =: [10] 


[m\ = [30] 




-0.663 


-0.666 


-0.660 


-0.623 


-0.604 


-0.594 


-0.582 


AX qi 


0.004 


0.004 


0.004 


0.004 


0.004 


0.004 


0.004 




1.480 


1.363 


1.350 


1.411 


1.459 


1.513 


1.676 




0.004 


0.004 


0.004 


0.004 


0.004 


0.004 


0.004 




-1.054 


-1.070 


-1.085 


-1.076 


-1.054 


-1.046 


-1.052 


AX q2 


0.010 


0.009 


0.009 


0.008 


0.009 


0.008 


0.009 




1.647 


1.684 


1.670 


1.737 


1.803 


1.907 


2.042 


Av q2 


0.008 


0.007 


0.007 


0.007 


0.007 


0.007 


0.008 
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Figure la 

Pade [0,4] Density Compared to the Data 
m] = [1] , L = 1 
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Figure lb 
Density Difference between 
the Data and Pade [0,4] 
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Figure lc 
Ratio R of the Data to the Pade [0,4] 
Densities for [m] = [1], L = 1 
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Figure 2 
Density Compared to the Data 
m] = [1] , L = 30 
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Figure 3 

Pade [0,4] Density Compared to Data 
m] = [2] , L = 1 




Figure 4 a 

Gaussian Density Compared to the Data 
m] = [1] , L = 1 




Figure 4b 
Density Difference between 
the Data and Gauss 
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Figure 4c 
Ratio R of the Data to the 
Gaussian densities for [m] = [1] , L = 1 
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Figure 5 
ln(q x ) versus ln(L/L Q ) 



for [m] = [1] 
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Figure 7 
Confidence Region for q 1 and q 2 



for 



m 



= [1] , L = 1 
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Figure 8 
Confidence Region for X l and V x 



for 
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Figure 9 
Confidence Region for X 2 and V 2 

for [m] = [1] 




